Roughfication of Numeric Decision Tables: The Case Study of Gene Expression Data
نویسندگان
چکیده
We extend the standard rough set-based approach to be able to deal with huge amounts of numeric attributes versus small amount of available objects. We transform the training data using a novel way of non-parametric discretization, called roughfication (in contrast to fuzzification known from fuzzy logic). Given roughfied data, we apply standard rough set attribute reduction and then classify the testing data by voting among the obtained decision rules. Roughfication enables to search for reducts and rules in the tables with the original number of attributes and far larger number of objects. It does not require expert knowledge or any kind of parameter tuning or learning. We illustrate it by the analysis of the gene expression data, where the number of genes (attributes) is enormously large with respect to the number of experiments (objects).
منابع مشابه
Presenting a new equation for estimation of daily coefficient of evaporation pan using Gene Expression Programming and comparing it with experimental methods (Case Study: Birjand Plain)
One of the most important componenets of water management in farms is estimating crops’ exact amount of evapotranspiration (water need). The FAO-Penman-Montheis (FPM) method is a standard method to evaluate other techniques which are used for easy calculation of potential evapotranspiration, when lysimeter datasheets are not available. This study was carried out based on 18 years’ climatic dat...
متن کاملPrediction of Acid Mine Drainage Generation Potential of A Copper Mine Tailings Using Gene Expression Programming-A Case Study
This work presents a quantitative predicting likely acid mine drainage (AMD) generation process throughout tailing particles resulting from the Sarcheshmeh copper mine in the south of Iran. Indeed, four predictive relationships for the remaining pyrite fraction, remaining chalcopyrite fraction, sulfate concentration, and pH have been suggested by applying the gene expression programming (GEP) a...
متن کاملMutations of p53 Gene in Skin Cancers: a Case Control Study
Background: The most frequently mutated tumor suppressor gene found in human cancer is p53. In a normal situation, p53 is activated upon the induction of DNA damage to either arrest the cell cycle or to induce apoptosis. However, when mutated, p53 is no longer able to properly accomplish these functions. The aim of this study was to investigate the expression of p53 gene in cases of skin cancer...
متن کاملInvestigating the Relation between LCK Gene Expression with Type 2 Diabetes Patients in Yazd Diabetes Research Center
Type 2 diabetes mellitus (T2DM) is characterized by insulin resistance and insulin secretory defect. Deficiency of cellular immunity is known as one of the factors involved in the pathogenesis of T2DM. lymphocyte-specific protein tyrosine kinase( LCK) is an important gene involved in the intracellular signaling pathways of lymphocytes. This study aimed at determining and comparing LCK gene expr...
متن کاملAltered expression of Lnc-OC1 and SIRT1 genes in colorectal cancer tissue
Backgrounds: SIRT1 plays an important role in many physiological processes, including metabolism, neuronal protection, senecence and inflammatory, by staging histones and multiple transcription factors. However, the complex mechanisms of SIRT1 signaling in tumors are not yet fully understood, as it acts as both an oncogen and a tumor suppressor. On the other hand, it has been shown that the Lnc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007